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Abstract 

Background: Leukocyte infiltration plays an important role in the pathogenesis and progression of myositis, and is 
highly associated with disease severity. Currently, there is a lack of: efficacious therapies for myositis; understanding 
of the molecular features important for disease pathogenesis; and potential molecular biomarkers for characterizing 
inflammatory myopathies to aid in clinical development. 

Methods: In this study, we developed a simple model and predicted that 1) leukocyte-specific transcripts 
(including both protein-coding transcripts and microRNAs) should be coherently overexpressed in myositis muscle 
and 2) the level of over-expression of these transcripts should be correlated with leukocyte infiltration. We applied 
this model to assess immune cell infiltration in myositis by examining mRNA and microRNA (miRNA) expression 
profiles in muscle biopsies from 31 myositis patients and 5 normal controls. 

Results: Several gene signatures, including a leukocyte index, type 1 interferon (IFN), MHC class I, and 
immunoglobulin signature, were developed to characterize myositis patients at the molecular level. The leukocyte 
index, consisting of genes predominantly associated with immune function, displayed strong concordance with 
pathological assessment of immune cell infiltration. This leukocyte index was subsequently utilized to differentiate 
transcriptional changes due to leukocyte infiltration from other alterations in myositis muscle. Results from this 
differentiation revealed biologically relevant differences in the relationship between the type 1 IFN pathway, 
miR-146a, and leukocyte infiltration within various myositis subtypes. 

Conclusions: Results indicate that a likely interaction between miR-146a expression and the type 1 IFN pathway is 
confounded by the level of leukocyte infiltration into muscle tissue. Although the role of miR-146a in myositis 
remains uncertain, our results highlight the potential benefit of deconvoluting the source of transcriptional changes 
in myositis muscle or other heterogeneous tissue samples. Taken together, the leukocyte index and other gene 
signatures developed in this study may be potential molecular biomarkers to help to further characterize 
inflammatory myopathies and aid in clinical development. These hypotheses need to be confirmed in separate and 
sufficiently powered clinical trials. 
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Background 

Myositis is characterized clinically by skeletal muscle weak- 
ness and histopathologically by the presence of inflamma- 
tory cells in muscle tissue. There are several major 
subclasses of myositis, including dermatomyositis (DM), 
polymyositis (PM), inclusion body myositis (IBM), and im- 
mune mediated necrotizing myopathy (NM). The leukocyte 
infiltration present in myositis muscle is believed to con- 
tribute to disease pathogenesis [1-5]. The types of immune 
cells present in myositis muscle were originally identified in 
the 1980s as predominantly CD4+ T cells and B cells in 
DM, and CD4+ and CD8+ T cells in IBM [1,6-8], with 
more recent identification of plasmacytoid dendritic cells in 
DM [9], myeloid dendritic cells in PM and IBM [3], and 
plasma cells in all three disorders [2]. Unlike DM, PM or 
IBM, NM is characterized by myofiber necrosis associated 
with macrophages and minimal T cell infiltration or MHC 
Class I expression [10]. Given the differences in clinical 
manifestations between these subtypes of myositis and the 
lack of optimal efficacious therapies for these diseases, 
understanding the molecular characteristics underlying 
their subtypes may facilitate the development of novel ther- 
apeutics that could benefit patients with myositis. 

Technologies such as whole genome microarray have 
advanced our understanding of the disease pathogenesis of 
myositis [11-13]. A large number of type 1 interferon- 
stimulated genes (ISGs) were identified to be strongly over- 
expressed in DM muscle and this molecular signature has 
been further confirmed by recent studies [4,14]. The activa- 
tion of type 1 interferon (IFN) signaling has been observed 
in many autoimmune diseases [15,16], such as systemic 
lupus erythematosus (SLE) [17-19], systemic sclerosis [20], 
rheumatoid arthritis [21], and psoriasis [22]. A type 1 IFN 
signature is not only present in DM muscle but also 
expressed in DM skin [23], as well as peripheral blood of 
DM and PM, reflecting disease activity [24-26]. Based on 
the accumulating evidence from recent microarray studies 
and other complementary experiments, disease models 
have been proposed to emphasize the central role of type 1 
IFN pathway activation in the pathogenesis of DM, suggest- 
ing that blockade of type 1 IFN might provide clinical bene- 
fit to DM patients [27]. 

In addition to previous studies focused on altered mRNA 
expression in myositis, the roles of microRNAs (miRNAs) 
in regulating immune responses, muscle development, and 
regeneration are also emerging [28-31]. miRNAs including 
miR-146a, miR-155, and miR-101 have been shown to be 
aberrantly expressed in rheumatic diseases [32-35] and 
miR-1, miR-133a/b, and miR-206 have been identified as 
muscle-specific miRNAs critical for muscle development 
and function [28-31]. Though the role of miRNAs in the 
pathogenesis of myositis has yet to be evaluated exten- 
sively, it is worth noting that interactions between miRNAs 
and type 1 IFN have been identified [36]. Specifically, miR- 



146a suppresses the innate immune response not only via 
the TLR-mediated NFkB pathway [37], but also negatively 
regulates the type 1 IFN pathway in SLE by targeting 
STAT1 and IFN regulatory factor 5 (IRF5) [38]. 

Despite the aforementioned studies, there still lacks a 
clear understanding of the disease pathogenesis that under- 
lies myositis. Meanwhile, few molecular biomarkers have 
been identified to aid in stratifying myositis patients or ob- 
jectively quantifying the leukocyte abundance in inflamma- 
tory muscle and the corresponding muscle fiber damage. 
To address the unmet needs in these areas, we performed 
both genome-wide mRNA and miRNA expression profil- 
ing in muscle biopsies from myositis patients and normal 
controls. Our studies revealed gene expression signatures 
specific for myositis and distinct for each subclass of myo- 
sitis, as well as multiple pairs of mRNA:miRNA displaying 
anti-correlation expression patterns in line with predicted 
relationships. Additionally, expression data from this study 
indicated that miR-146a displayed a positive correlation 
with the type 1 IFN signature rather than the expected 
negative correlation in myositis muscle. We postulated that 
such positive correlation could be driven by infiltrated 
leukocytes; therefore, we developed an invasion model to 
account for transcriptional changes due to leukocyte infil- 
tration. Further analyses with this invasion model indicated 
that the source of ISG expression may differ between sub- 
types of myositis, such that in PM and IBM, ISG expres- 
sion is associated with infiltrated leukocytes, whereas in 
DM, non-leukocyte cells (e.g., muscle cells) might contrib- 
ute significantly to ISG expression. Collectively, our results 
revealed multiple gene expression signatures that can po- 
tentially advance our understanding of the pathologic char- 
acteristics of myositis and provide utility as molecular 
biomarkers for identifying the right therapeutics for myo- 
sitis patients. 

Results 

Altered mRNA and miRNA expression reveals 
upregulation of immune pathways and downregulation 
of muscle contraction pathways in myositis 

A total of 837 probesets that represent 606 unique genes 
were differentially expressed by two-group £-tests in muscle 
specimens of myositis patients compared with non- 
neuromuscular disease patients (Additional file 1: Table 
SI). Most differentially expressed transcripts (88%, 740 of 
837) were over-expressed, with only 97 (12%) down-regu- 
lated. Functional analysis using IPA (Ingenuity Pathway 
Analysis, http://www.ingenuity.com/) and gene set enrich- 
ment showed that the most upregulated genes are involved 
in the immune and inflammatory responses, including a 
large number of ISGs, MHC class I/II proteins, and genes 
associated with natural killer cell mediated cytotoxicity. In 
contrast, muscle contraction and calcium signaling 
pathway-related genes are enriched in the down-regulated 
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transcripts (p-value < 0.001 and ^-value = 0.001, respect- 
ively). In addition to the identification of differentially 
expressed transcripts in myositis compared to normal con- 
trols, we also identified subclass-specific differentially 
expressed mRNAs by comparing each myositis subtype to 
the normal controls (Additional file 2: Supplementary Text; 
Additional file 2: Figure S2; Additional file 3: Table S2). 

A similar statistical analysis to determine differentially 
expressed mRNAs was applied to miRNA expression 
data to identify differentially expressed miRNAs. Of 69 
differentially expressed miRNAs identified when com- 
paring myositis muscle with normal controls, only five 
were over-expressed (miR-146a/b, miR-155, miR-21 and 
miR-432). The remaining 64 miRNAs were down- 
regulated, including miR-133a/b, miR-1, and miR-29c 
(Additional file 4: Table S3). Notably, both miR-146a 
and miR-155 are predominantly expressed in leukocytes 
and also play important roles in regulating immune 
responses [32-35]. 

Unsupervised hierarchical clustering of the 837 differ- 
entially regulated mRNA transcripts revealed two main 
transcriptionally-defined subcategories, with all normal 
and NM samples clustered within one branch (Figure 1, 
branch 1), and all IBM samples clustered within the 
other branch (Figure 1, branch 2). DM and PM samples 
are divided into both branches. Although unavailable for 
the patients within this study, additional information 
such as myositis-specific antibodies and other serology 
data could have contributed to a more optimal pheno- 
typing of the myositis specimens. The lack of these data 
may explain some of the heterogeneity observed within 
the groups defined strictly by the diagnostic criteria NM, 
DM, PM, and IBM. Nevertheless, our results suggest 
that specific molecular expression patterns, in addition 
to others previously defined for myositis, might provide 
insight into classifying and understanding the patho- 
physiology of these different subtypes of myositis. 

Consensus clustering reveals five gene signatures that 
characterize myositis 

In addition to identifying molecular expression patterns 
specific for myositis as a whole and distinct for its indi- 
vidual subtypes, we also applied and merged multiple 
clustering algorithms to identify robust clusters of genes 
that may provide unique insight into the molecular com- 
ponents of myositis (see Methods for the details). To 
rule out uninformative genes, we selected the top 200 
genes with the most varied expression from the 606 dif- 
ferentially expressed genes, among which, only three 
were down-regulated. Therefore, we focused on the 
remaining 197 up-regulated genes and clustering pro- 
duced five clusters with robust gene members (i.e., clus- 
ter A-E; Additional file 2: Figure S2C and S3; also see 
Methods). 



Subsequent functional analyses of the five clusters 
revealed that clusters A and B were enriched with type 1 
ISGs and immunoglobulins, respectively; clusters C and D 
had very similar profiles (Figure 2 A) and were enriched 
with immune response-related genes (Additional file 5: 
Table S4); and cluster E was dominated by MHC class I 
genes. These observations led us to define five gene signa- 
tures enriched in myositis muscle tissue: type 1 IFN signa- 
ture (from cluster A), immunoglobulin signature (from 
cluster B), leukocyte I and II indices (derived from clusters 
C and D, respectively) and MHC class I signature (from 
cluster E). The similarity in cluster results shared by the 
different clustering methods confirmed the scientific rigor 
of this analysis (Additional file 5: Table S4). We then pro- 
ceeded to analyze disease heterogeneity within each myo- 
sitis subclass by utilizing these five gene signatures 
(Figure 2B-C). In strong agreement with results shown in 
Figure 1, NM displayed homogeneous gene signature pro- 
files similar to normal muscle, with a slightly elevated 
leukocyte II signature observed (see Figure 2B-C). IBM 
exhibited clear over-expression of all five gene signatures. 
Gene signatures in PM and DM varied from normal-like 
signatures to patterns similar to IBM. Most myositis 
patients showed coordinate overexpression across all 
modules (shown as an equilateral right pentagon in the 
radial plot; Figure 2B), but DM showed some hetero- 
geneity of relative overexpression of the type 1 IFN 
module (i.e., cluster A). 

Both leukocyte index I and II are comprised of over- 
expressed genes related to immune function (Additional 
file 5: Table S4); displayed differences in magnitude be- 
tween myositis patients; and are highly correlated with 
each other (Spearman rank test; r = 0.97, p < 0.001; see 
Figure 2A). Therefore, we selected leukocyte index I 
(generated from cluster C) as the leukocyte infiltration 
index for subsequent analyses examining altered gene 
expression due to increased immune cell infiltration into 
muscle. 

A leukocyte index distinguishes altered expression of 
transcripts due to immune cell infiltration and correlates 
with histopathology results 

In myositis, leukocyte infiltration and/or clonal expan- 
sion results in a marked increase in the abundance of 
leukocytes in muscle. Consequently, the interpretation 
of gene expression studies in myositis tissue biopsies 
can be greatly influenced by immune cell infiltration. 
In a principal components analysis of the 837 differ- 
entially expressed transcripts in this study, we found 
that the first principal component (PCI) accounted 
for 75.6% of the variance in the data. PCI had a sig- 
nificant linear association with our proposed 
leukocyte index (7? 2 =0.92, j9<0.001), suggesting that 
the majority of the overall expression variation was due to 
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Figure 1 Heatmap of 837 differentially expressed transcripts. Transcript expression fold change is represented by the color scale increasing 
from blue to white to red; the color key is shown in the top left corner. The 36 subjects are shown horizontally and represented by the color bar 
on the top of the heatmap. The color that corresponds to each disease type is displayed in the upper right corner. At the first bifurcation of the 
horizontal dendrogram, the subjects are classified into 2 groups from left to right: group 1 and group 2. 



the change of leukocyte cell numbers and could be 
accounted for by this index. This computational represen- 
tation of leukocyte infiltration could eliminate the tran- 
scriptional variation generated solely from the increased 
immune cell proportion in muscle samples and allow gene 
expression changes to be attributed to the appropriate 
source. 



The ability of the leukocyte index to quantify the extent 
of leukocyte infiltration is indirectly supported by the sig- 
nificant enrichment of immune response-related tran- 
scripts among the differentially expressed transcripts 
(hypergeometric test, p< 0.001), as well as the robust clus- 
tering of these genes. Additionally, there is a significant 
linear correlation between the leukocyte index and 
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Figure 2 Gene signatures derived from clusters A-E from the consensus cluster. (A) Pairwise scatter plots of the five gene signatures. The 
scatter plots of the signatures of the clusters A-E are displayed in the lower panels, where data points are colored by the sample type (same as 
the Figure 1) and red lines represent the lowess (Locally Weighted Scatterplot Smoothing) lines. The corresponding spearman correlation 
coefficients are displayed in the upper panels, with significance levels indicated by the stars (*** p < 0.001, ** p < 0.01, * p < 0.05). (B) Subclasses 
of myositis characterized by the five gene signatures are shown in the star plots. Each subject is represented with a specific pentagon in the star 
plot and the vertices of one pentagon along the spokes (A-E) represent enrichment of the signature scores from clusters A-E. Colors of 
pentagons are randomly assigned to distinguish subjects and star plots are grouped by the myositis subclasses. (C) Subjects can be further 
characterized by the five gene signatures using a heatmap. The signature scores are represented by the rainbow color in the heatmap, ranging 
between 0 (colored in blue) and 4 (colored in red); the color key for each sample's disease type is displayed horizontally between the sample 
identifiers and the heatmap, where, (from left to right), green denotes normal, gray for NM, blue for DM, magenta for PM and red for IBM. The 
four DM patients with perifascicular atrophy (PFA) are marked by the star symbol. 



immune cell infiltration identified and scored by 
hematoxylin and eosin (H&E) staining (7^=0.79, p<0Wl; 
Figure 3). This provides direct evidence to support the 
leukocyte infiltration index described here by confirming 
that the index accurately reflects leukocyte abundance in 
muscle. 

In addition to this ability of the leukocyte index to cor- 
rectly reflect the abundance of immune cells in muscle 
tissue, it may also inversely correlate with the expression 
of muscle-specific mRNA or miRNA due to either a 
decreased proportion of muscle cells in the sample or 
muscle damage. For instance, TTN is a structural 
protein abundant in myofibers whose deficiency was 
found to be associated with DM perifascicular atrophy 



(DM-PFA) [14]; miR-1 is a muscle specific miRNA 
[39] that has an important role in muscle develop- 
ment [28-31], Both TTN and miR-1 exhibited a 
marked anti-correlation with the leukocyte index 
(Spearman rank test; r = -0.70, /?<0.001 and r = -0.64, 
p< 0.001, respectively), providing additional evidence to 
support the validity of our invasion model and the 
derived leukocyte index. 

Multiple mRNAs dysregulated in myositis muscle are not 
due to leukocyte infiltration 

The development of the leukocyte infiltration index 
revealed that most of the over-expressed transcripts in 
myositis muscle compared to normal muscle specimens 
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H&E Staining 

Figure 3 Correlation between leukocyte infiltration index and 
H&E staining. A significant linear correlation is displayed in the 
scatter plot between the leukocyte infiltration index and the degree 
of inflammation assessed by H&E staining (0=no inflammation, 
1=sparse inflammation, 2=mild inflammation, 3=moderate 
inflammation, and 4=severe inflammation). Data points are colored 
by the sample type; color key is displayed in the upper right corner 
of the plot. The regression line is denoted by the blue dotted line 
(intercept=0.07, slope=1.00; /? 2 =0.79, p<0.001). 



were not truly up-regulated due to transcript over- 
expression, but rather these expression changes were 
actually due to leukocyte cells infiltrating the muscle tis- 
sue. Accordingly, we adjusted the expression data matrix 
using the infiltration index to identify over-expressed 
transcripts unaccounted for by leukocyte infiltration. 
Consequently, only 41 (5.5%) out of 740 over-expressed 
mRNAs remained over-expressed after adjustment with 
the infiltration index (Additional file 6: Table S5). These 
miRNAs included many immunoglobulins (IGK, IGH, 
IGL), chemokines (CCL8, CCL11, CCL18), myosin com- 
plex genes, and cytoskeletal genes (POSTN, TNNT2 and 
MYH8). The dramatic reduction of over-expressed tran- 
scripts after adjusting for leukocyte infiltration con- 
firmed that most of the over-expression is driven by 
increased leukocyte abundance in myositis muscle. 

Subclass-specific transcript analysis following adjust- 
ment for leukocyte infiltration (Additional file 7: Table 
S6) revealed that immunoglobulin proteins and some 
chemokines were over-expressed in both IBM and PM, 
whereas ISGs were heavily over-expressed in DM 
muscle. Overall, our results suggested that myofibril and 
chemokine signatures exist in DM, PM and IBM, and 
likely resulted from changes in gene expression in the 
muscle cells themselves rather than alterations due to 
leukocyte infiltration. Understanding the contribution of 
these pathways altered in muscle tissue may provide 
keen insight into the pathogenesis of specific subclasses 
of myositis. 



The leukocyte index and the type 1 IFN signature in 
myositis muscle 

After re-assessing gene expression changes to account for 
leukocyte infiltration, we observed that ISGs remained 
highly over-expressed in muscle biopsies particularly from 
DM patients. Accordingly, we investigated the relation- 
ship between over-expression of ISGs and the leukocyte 
index in more depth. Strikingly, the type 1 IFN signature 
showed a significant linear correlation with the leukocyte 
index with exception of four outliers (i.e., the DM sub- 
jects GEIM 12, 272, 382 and 383; see Figure 4). The linear 
correlation suggests that leukocyte infiltration might have 
a more pronounced effect on ISG over-expression in IBM, 
PM, and DM without perifascicular atrophy (PFA; see 
Table 1); whereas in a subset of DM patients with 
marked ISG over-expression, especially those with PFA, 
other cell types (e.g., muscle fiber cells) might contribute 
to type 1 IFN pathway activation. 

Observations that a correlation between ISG over- 
expression and leukocyte infiltration varied between myo- 
sitis subtypes prompted us to also investigate the correl- 
ation of miR-146a with the leukocyte index as this 
miRNA has been previously reported to negatively regu- 
late the type 1 IFN pathway in SLE [38]. Our results indi- 
cated that both miR-146a and the type 1 IFN signature 
display an overall positive correlation with the leukocyte 
index (Spearman rank test; r = 0.64, p < 0.001; r = 0.72, 
p < 0.001, respectively; also see Figure 4), and therefore 
they are likely to be positively correlated with each other 
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Figure 4 Correlation between type 1 IFN signature score and 
leukocyte infiltration index in myositis. A significant linear 
correlation is displayed in the scatter plot between type 1 IFN index 
and leukocyte infiltration index. Data points are colored by the 
sample type; color key is displayed in the upper right corner of the 
plot. The regression line (/? 2 =0.41, p<0.001) is denoted by the black 
dotted line and 95% confidence interval is outlined by the red 
dotted line and gray shaded background. The subject identifiers of 
the four outliers are labeled. When excluding the four outliers, the 
/?-squared value in the regression analysis escalates from 0.41 to 0.91. 
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Table 1 Patient information 



Subject ID 


Age 


Gender 


Disease type* Muscle 


Disease duration (months) 


Treatments 


GEIM12 


74 


F 


DM Biceps 


1 


None 


GEIM54 


62 


M 


PM Quadriceps 


24 


None 


GEIM57 


38 


F 


Normal Biceps 


N/A 


N/A 


GEIM63 


37 


F 


Normal Biceps 


N/A 


N/A 


GEIM64 


77 


M 


NM Biceps 


6 


Pred 


GEIM90 


90 


M 


PM Deltoid 


3 


None 


GEIM96 


86 


F 


PM Biceps 


3 


None 


GEIM127 


68 


F 


IBM Biceps 


36 


Azathioprine 


GEIM133 


60 


F 


Normal Quadriceps 


N/A 


N/A 


GEIM174 


36 


F 


PM Biceps 


3 


None 


GEIM272 


25 


F 


DM (with PFA) Biceps 


3 


Pred x 4 days 


GEIM306 


64 


F 


NM Biceps 


17 


Pred 


GEIM312 


76 


F 


PM Biceps 


96 


None 


GEIM315 


39 


F 


DM Biceps 


3 


None 


GEIM322 


58 


F 


PM Biceps 


180 


None 


GEIM331 


87 


F 


PM Deltoid 


36 


Pred 


GEIM335 


71 


F 


IBM Quadriceps 


24 


None 


GEIM340 


83 


F 


NM Biceps 


66 


Pred 


GEIM349 


28 


F 


DM Biceps 


18 


Pred, Azathioprine 


GEIM354 


76 


M 


IBM Biceps 


60 


None 


GEIM363 


79 


F 


IBM Biceps 


36 


None 


GEIM365 


41 


M 


Normal Biceps 


N/A 


N/A 


GEIM368 


61 


F 


Normal Quadriceps 


N/A 


N/A 


GEIM372 


61 


M 


IBM Quadriceps 


84 


Dexamethasone taper 


GEIM373 


80 


F 


IBM Quadriceps 


90 


None 


GEIM374 


47 


F 


IBM Quadriceps 


54 


None 


GEIM375 


50 


F 


NM Quadriceps 


6 


None 


GEIM376 


37 


M 


DM Deltoid 


4 


None 


GEIM377 


50 


M 


IBM Biceps 


24 


None 


GEIM380 


61 


M 


IBM Quadriceps 


24 


None 


GEIM382 


30 


F 


DM (with PFA) Deltoid 


1 


None 


GEIM383 


44 


F 


DM (with PFA) Deltoid 


2 


Pred x 1 month 


GEIM385 


61 


F 


PM Biceps 


60 


Etancercept 


GEIM389 


40 


M 


DM (with PFA) Biceps 


5 


N/A 


GEIM391 


71 


M 


NM Quadriceps 


7 


None 


GEIM397 


77 


M 


IBM Biceps 


84 


None 



* DM, dermatomyosits; PM, polymyositis; IBM, inclusion body myositis; PFA, perifascicular atrophy. 



as well Upon further evaluation, this positive pattern of as- 
sociation was evident primarily in IBM and PM (Figure 5), 
where leukocyte infiltration may be responsible for the 
observed alterations in miR-146a expression. In contrast, 
6 out of 8 DM samples displayed a negative pattern of as- 
sociation between the expression level of miR-146a and 
the type 1 IFN signature, similar to what has been shown 
previously in SLE [38]. 



To examine the possibility that miR-146a could nega- 
tively regulate the type 1 IFN signature in muscle tissue, 
we utilized an in vitro muscle cell model. The level of 
miR-146a expression was altered in C2C12 muscle cells 
by transfecting either miR-146a mimics or miR-146a 
inhibitors into this cell line prior to stimulation with 
type 1 IFN. Measuring the resulting expression levels of 
ISGs illustrated that miR-146a is able to negatively 



Zhu et al. BMC Medical Genomics 201 2, 5:53 
http://www.biomedcentral.eom/1755-8794/5/53 



Page 8 of 12 



M 0 
M 0) 



/ GEJM272 




!• GEIM382 N N s 




> • 

G$IM12 


GEIM3-15 • 


GEIM383 


/ $ 
• 

7 \ 






GEIM3'4 9 


GEIM389 


/ # • 


X • 

GEI-W376 



Sample Key- 
Normal 
NM 

• DM 

• PM 

• IBM 



Expression of miR-146a 

Figure 5 Intricate association between the expression of miR- 
146a and type 1 IFN signature score in a subclass of myositis. 

Data points are colored by the sample type in this scatter plot; color 
key is displayed in the upper right corner of the plot. The identity 
reference line y=x is denoted by the blue dotted line. Type 1 IFN 
signature displays a positive pattern of association in most subjects 
including the normal control, NM, PM, IBM and some DM, as 
highlighted in the light blue shaded oval. On the other hand, the 
type 1 IFN signature displays a negative pattern of association with 
the expression level of miR-146a in most DM (highlighted in the 
pink shaded oval). The subject identifiers of the eight DM subjects 
are all labeled in this plot. 



regulate the type 1 IFN pathway in muscle cells 
(Figure 6), as demonstrated by the increased ISG expres- 
sion following inhibition of miR-146a and the decreased 
ISG expression following increased miR-146a levels. 
These trends and degree of effect on ISGs by miR-146a 
are in agreement with what has been reported previously 
in SLE [38], and suggest a possible role for miR-146a in 
regulating the type 1 IFN pathway in muscle, although 
this remains to be verified by in vivo studies. 



Discussion 

Most biological tissue specimens are heterogeneous, often 
consisting of multiple cell types. This heterogeneity is evi- 
dent in myositis muscle biopsies and confounds transcrip- 
tional analyses, as the source of an observed expression 
change is unclear. We developed a leukocyte index to dif- 
ferentiate the transcriptional alterations due to an increased 
proportion of inflammatory cells from other changes at the 
expression level. The positive correlation of this computa- 
tional leukocyte index with immune cell infiltration scored 
from H&E staining and the expression of lymphocyte- 
specific mRNAs/miRNAs validated our invasion model and 
further confirmed the use of this index as a surrogate for 
leukocyte abundance in muscle. The ability to distinguish 
between muscle-derived transcripts and genes over- 
expressed due to leukocyte infiltration may improve the 



interpretation of gene expression results and increase exist- 
ing knowledge of molecular pathways critical for myositis. 

In addition to the leukocyte index, we have also identi- 
fied several other gene signatures (each from a consen- 
sus cluster with coherently expressed gene members): 
MHC class I, type 1 IFN, and immunoglobulin signa- 
tures. The type 1 IFN, MHC class I, and immunoglobu- 
lin signatures have been reported previously in various 
subclasses of myositis, in good agreement with the 
results in this study. Also, the current study is the first 
to systematically evaluate these genomic signatures to- 
gether, which could provide important insight into the 
pathogenesis and potential therapeutic targets for myo- 
sitis. Several transcripts displaying a good correlation 
with the leukocyte index and/or the type 1 IFN signature 
in muscle were also identified in this study, such as 
TTN, miR-1, miR-155 and amyloid beta (A4) precursor 
protein-binding, member B and member 1 interacting 
protein (APBB1IP). However, the small sample size and 
insufficient clinical information preclude us from evalu- 
ating the association of these molecular characteristics 
with the disease activity/ severity within this study. Add- 
itionally, there was not adequate material to conduct 
extensive immunohistochemistry studies on patients in 
this study to correlate the presence of immunoglobulin 
and MHC class I protein levels with the corresponding 
gene signature scores. Continued effort to evaluate these 
molecular features in clinical studies where detailed clin- 
ical/demographic information is available will allow for 
further characterization of the utility of these potential 
biomarkers to aid in the patient stratification or to pre- 
dict therapeutic efficacy. 

In contrast to the positive correlation identified be- 
tween the type 1 IFN signature and leukocyte infiltration 
index in PM and IBM, a positive association was not 
observed in most DM samples. One possible explanation 
is that the type 1 IFN signature in PM and IBM is 
mainly driven by leukocyte infiltration into muscle, while 
distinct drivers might be present in DM. Our results 
support the previous finding [14] that the type 1 IFN sig- 
nature in DM might be connected with PFA (Figure 4), 
although a larger sample size is desired. Notably, this 
association between PFA and the type 1 IFN signa- 
ture was also accompanied by the presence of the 
MHC class I signature (see Figure 2B, C). Additional 
studies need to be carried out in the future to confirm 
these observations. 

The ability to distinguish the role of different cell types 
at disease sites is important in identifying anti-correlations 
between miRNA and mRNA expression, such as the ex- 
pression of miR-29c and collagen genes (Additional file 2: 
Supplementary Text and Supplementary Methods) and 
other anti-correlations that may be important to the patho- 
genesis of myositis. Anti-correlations comparing mRNA 
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Figure 6 miR-146a alters the expression of type 1 IFN inducible genes in C2C12 muscle cells. C2C12 cells were transfected with miR-146a 
mimics and miR-146a inhibitors, as well as the appropriate scrambled controls. 24 hours after transfection, cells were stimulated with mouse IFN- 
alpha for 4 hours before RNA was isolated and profiled for alterations in ISGs. Regulation of the IFN pathway was demonstrated by increased ISG 
transcript levels following inhibition of miR-146a, as well as decreased ISG transcript levels following addition of miR-146a. Values are shown as 
mean ± SEM and represent results from 3 independent experiments. 



and miRNA expression levels could also be complicated by 
the effect of increased leukocyte cell abundance, potentially 
masking true correlations between mRNAs/miRNAs in 
various disease settings. The positive correlation identified 
between miR-146a and the type 1 IFN signature in PM 
and IBM is one example where the increased number of 
leukocytes in muscle influenced the resulting gene expres- 
sion comparison between myositis patients and normal 
subjects. Taken together, these results strongly support the 
application of the leukocyte infiltration index to transcrip- 
tomic studies to appropriately characterize altered mRNA 
and/or miRNA expression from heterogeneous biological 
samples, thus allowing for more meaningful interpretation 
of microarray data with the aim of increasing our under- 
standing of the fundamental mechanisms of disease 
pathogenesis. 

Evaluation of the effect of miR-146a in the suppression 
of the type 1 IFN pathway in myositis muscle revealed 
an interaction that was confounded by leukocyte infiltra- 
tion. Nevertheless, our results indicated that a subset of 
DM patients had high type 1 IFN signatures and low 
miR-146a expression in muscle. Additional in vitro 
results showed that altered miR-146a levels in muscle 
cells could regulate the activation of the type 1 IFN 
pathway. Due to the small number of myositis samples 
displaying an anti-correlation between miR-146a and the 
type 1 IFN pathway and the lack of in vivo studies 



confirming this relationship, the exact functional role of 
miR-146a in myositis remains uncertain. However, con- 
tinued work on characterizing this relationship in myo- 
sitis is warranted. 

Conclusions 

In this study comparing transcriptional changes between 
myositis muscle biopsies and normal controls, we identi- 
fied several gene signatures, including a leukocyte index 
and a type 1 IFN signature, as genomic biomarkers to 
characterize myositis subjects at the molecular level. 
Additionally, use of the leukocyte index to account for 
transcriptional changes due to increased lymphocyte in- 
filtration into muscle revealed that the majority of these 
changes are driven by increased leukocyte abundance. 
Investigation into the relationship between leukocyte in- 
filtration and a type 1 IFN signature suggested that 
increased ISGs might have two distinct sources at the 
myositis disease site: one is closely associated with 
leukocyte infiltration in IBM, PM and some DM; and 
the other source is not certain, but may be due to MHC 
class I-expressing myofibers, especially in some DM 
patients. Further understanding of the relationship be- 
tween these signatures in vivo, as well as any association 
with myositis disease severity or activity will be explored 
in future clinical studies to evaluate their utilities as bio- 
markers. The ability to differentiate the transcriptional 
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alterations due to an increased proportion of inflamma- 
tory cells from other changes at the expression level 
increases our ability to connect changes in specific cellu- 
lar subsets with important biological phenomena critical 
for disease development or progression. 

Methods 

Patients and tissue samples 

Muscle samples from 36 patients (5 normal controls, 
5 NM, 8 DM, 8 PM and 10 IBM) were studied (Table 1). 
Normal controls are those subjects who were not sus- 
pected clinically to have neuromuscular disease; had 
normal muscle strength by examination; and showed 
normal serum CK levels. Diagnostic criteria for IBM, 
DM, and PM were as previously described [11]. Patients 
with NM had an acute or subacute myopathy responsive 
to immunotherapy, with pathology showing necrotic 
muscle fibers without inflammatory cells other than 
macrophages. Myositis-specific antibodies may also be 
highly predictive of the clinical pheno-subtype of myo- 
sitis; however, these data were not available for the 
patients utilized in this study and could not be incorpo- 
rated into the overall phenotyping of the samples. 

Open muscle biopsies were performed at the time of a 
diagnostic biopsy (the biopsy site is listed in Table 1) 
and immediately frozen in liquid nitrogen for RNA and 
miRNA. A separate piece of muscle was used for 
hematoxylin and eosin (H&E) staining. Patients provided 
informed consent for research and institutional review 
boards approved all studies. 

Hematoxylin and eosin (H&E) staining and assessment 

H&E stained frozen sections were examined microscopic- 
ally by a single examiner blinded to diagnosis. The degree 
of inflammation graded between 0-4 (0=no inflammation, 
l=sparse inflammation, 2= mild inflammation, 3=moderate 
inflammation, and 4=severe inflammation). 



Total RNA extraction 

miRNA-RNA was extracted from muscle biopsies using the 
mirVana miRNA Isolation kit (Applied Biosystems/Ambion, 
Austin, TX), according to manufacturers instructions. 

mRNA profiling by Affymetrix microarray 

Small species RNAs (e.g. miRNA, snRNA, tRNA) were 
removed from a volume of each miRNA-RNA sample 
using Agencourt RNAclean magnetic beads (Beckman- 
Coulter, Brea, CA). The resulting total RNA was profiled 
using Affymetrix Human Genome U133 plus 2.0 Gene- 
Chips® (Affymetrix, Santa Clara, CA). A selection of 
genes with high expression intensities identified by 
microarray were validated by quantitative Real-Time 



PCR (Additional file 2: Supplementary Methods; 
Additional file 2: Figure S4). 

miRNA profiling by ABI TaqMan Low-Density Array (TLDA) 

miRNAs were profiled using TLDA microRNA Cards vl.O 
(Applied Biosystems, Foster City, CA). Single-stranded 
cDNA synthesis from 100 ng of total miRNA-RNA and 
pre-amplification of specific miRNA targets was performed 
according to the manufacturers protocol. The microRNA 
cards were loaded and run on an Applied Biosystems 
7900HT Real-Time PCR system. 

Normalization of microarray and TLDA expression data 

Probe-level summaries of microarray expression data were 
calculated using fRMA, which was implemented in R 
Bioconductor (http://bioconductor.org). The microarray 
expression data and the normalized expression data matrix 
were deposited at the NCBI Gene Expression Omnibus 
(accession number GSE39454). For quantitative RT-PCR, 
data analysis of Ct values was conducted with SDS v2.2.2 
software (Applied Biosystems). All samples were normal- 
ized to the mean Ct value of the endogenous control tran- 
script RNU48. 

Statistical analysis 

All statistical analyses were performed using R (http:// 
www.r-project.org). Two group £-test was applied to 
identify differentially expressed mRNA/miRNAs in myo- 
sitis muscle biopsies compared to the normal controls. 
The empirical Bayes method (from Bioconductor limma 
package) was also employed to identify myositis 
subclass-specific expression patterns [37]. In both of the 
overall and subclass-specific comparisons, the transcripts 
with average fold change > 4 and adjusted j^-value for 
Benjamini-Hochberg < 0.05 are considered significant. 

Invasion model 

Measurements of gene expression from heterogeneous 
samples are typically confounded by the abundance of 
the constituent cell types, which generally can be formu- 
lated as a linear model. In this study, we considered a 
special case of the linear model, where the proportion of 
a certain subset t of cell types increases significantly in a 
certain condition (e.g., the disease state) compared to a 
normal condition. We named this model the invasion 
model On the basis of some simple interpolation (see 
Additional file 2: Supplementary Methods for details), 
the invasion model characterizes several valuable out- 
comes when the special case holds: 1) variations of the 
gene expression will result predominantly from the 
alterations in proportions of the cell type t; 2) the t cell 
type-specific gene will be co-overexpressed even if there 
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is either no or negative correlation between gene expres- 
sion; 3) the average of the fold changes of the t cell type- 
specific gene expression is a good indicator of the 
increase in the fraction of the cell type t in a state change, 
for example, from the normal state to a disease state. The 
portions of invading cell types (or subsets) may change in 
distinct patterns and thus multiple cell type-specific gene 
signatures will be identified in that case. 

The invasion model can be readily applied to myositis to 
assess the leukocyte infiltration into the disease site. Abun- 
dance of leukocytes is low in normal skeletal muscles and 
may increase dramatically in inflammatory myopathies. 
Given the existence of some mRNAs and/or miRNAs pre- 
dominantly expressed in leukocytes, we should observe 
strong concordance among the overexpression of those 
particular protein-coding and/or non-protein coding tran- 
scripts. In practice, constant expression of those transcripts 
is not a requisite in the invasion model as long as the vari- 
ation of the expression is minor relative to the variation of 
the lymphocyte cell abundance due to the infiltration, 
according to the equation (5) in Additional file 2: Supple- 
mentary Methods. It is anticipated that the leukocyte- 
specific transcripts should display "coherent" overexpression 
compared to the normal muscle biopsies and form one or 
more large clusters. As a result, the overexpression of those 
mRNA/miRNAs could serve as a gene signature to quantify 
the level of the infiltration of one or multiple subtypes of 
dominant leukocytes in myositis muscle. 

Consensus clustering and gene signatures 

A consensus clustering approach was employed to deter- 
mine the proper cluster numbers and the robust mem- 
bers within each cluster by using R clusterCons package 
[40]. Multiple clustering algorithms (including agglom- 
erative nesting clustering (agnes), divisive analysis clus- 
tering (diana), /c-means, partitioning around medoids 
(pam), and hierarchical clustering (hclust)) were applied 
with a bootstrapping approach. The clustering results 
were further merged to identify the consensus clusters 
and their robust members. The membership robustness 
of each gene ranged from 0 to 1, defined as the average 
connectivity between a gene and all other members of 
the cluster [40]. Thus, a robust member, by definition, 
should have a high score (i.e., close to 1). We empirically 
used 0.6 as the cutoff value of membership robustness 
and took any cluster with at least five robust members 
to define the gene signature. For each cluster, a gene sig- 
nature was defined as the median of the fold changes of 
the robust members relative to the normal controls. 

To reduce the scale of the clustering problem and ex- 
clude uninformative genes, we applied the clustering 
only on a subset (i.e., -200) of the differentially 
expressed genes that had the highest expression variance 



across subjects as recommended by the clusterCons 
developers [40]. 

Functional analysis of differentially expressed mRNAs 

The differentially expressed transcripts are annotated in 
multiple ways: Ingenuity Pathway Analysis Tool (IPA; 
Redwood City, CA), hypergeometric enrichment of gene 
ontology (GO) terms, KEGG pathways (using Biocon- 
ductor GOstats package) and the gene sets from 
MSigDB (Molecular Signatures Database, http://www. 
broadinstitute.org/gsea/msigdb/index.jsp). IPA micro- 
RNA Target Filter is also utilized to explore the likely 
interaction between differentially expressed mRNAs and 
miRNAs (see Additional file 2: Supplementary Methods; 
Additional file 8: Table S7). 

C2C12 cell maintenance and transfection 

C2C12 cells were purchased from American Type Culture 
Collection (ATCC, Manassas, VA) and cultivated in 
recommended media at 37°C in a humidified atmosphere 
with 5% C0 2 . Artificial miR-146a mimics (Dharmacon) 
and miR-146a inhibitors (Dharmacon), as well as the 
appropriate scrambled controls, were transfected into 
C2C12 cells with PrimeFect siRNA (Lonza) according to 
manufacturers protocol. 24 hours after transfection, cells 
were stimulated with mouse IFN-alpha (PBL interferon 
source) for 4 hours before RNA was isolated and profiled 
for alterations in IFN-inducible genes. 
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